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ABSTRACT 

Mobiles are nowadays the most relevant communication de- 
vices in terms of quantity and flexibility. Like in most M ANETs 
ad-hoc communication between two mobile phones requires 
mutual trust between the devices. A new way of establish- 
ing this trust conducts social trust from technically measur- 
able data (e.g., interaction logs). To explore the relation 
between social and technical trust, we conduct a large-scale 
survey with more than 217 Android users and analyze their 
anonymized call and message logs. We show that a reliable 
a priori trust value for a mobile system can be derived from 
common social communication metrics. 

Categories and Subject Descriptors 

C.2.0 [Computer-Comm. Networks]: General — Secu- 
rity and protection (e.g., firewalls) 

General Terms 

Smartphone measurement, communication logs, security 

1. INTRODUCTION 

The ever increasing distribution of smartphones creates 
new networking possibilities. Equipped with several net- 
work interfaces the mobiles can form spontaneous ad hoc 
networks (e.g., to bypass bad links). Connecting to another 
device as sender, receiver, or relay for data packets, always 
requires trust towards the peer, as reciprocal dependencies 
arise concerning data privacy, service availability, and incen- 
tives for investing energy. Determining the trustworthiness 
of nearby devices is an important but also challenging re- 
quirement for ad hoc communication as central instances or 
pre-shared secrets are not feasible. 



New security approaches 13 12 arise that try to apply 
social trust of the phones' users to derive technical trust 
between devices. Bridging the gap between such social trust 
and measurable data on the devices is still an open research 
topic. On the one hand, complete logs and directories stored 
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at the mobile are required to explore the data space. On the 
other hand, a detailed understanding of appropriate metrics 
for trust calculation is bound to the direct interaction with 
the end user (i.e., questionnaires) at least for verification. 

In this paper, we conduct a large-scale survey and mea- 
surement with over 217 Android users to identify common 
communication parameters that are available at mobiles and 
that can indicate trust relationships between individuals. 
The contributions of this paper are the following: (1) We 
analyze more than 22,365 calls and 84,325 messages of a 
globally distributed set of subjects with respect to the gen- 
eral communication behaviour and trust establishment; (2) 
we map social trust on technical trust; (3) we evaluate trust 
prediction between smartphones. From a network perspec- 
tive, we use the (maybe unusual) methodology of a survey. 
However, we explicitly note that the ranking of communi- 
cation patterns according to trust requires a feedback from 
the users. 

Current studies either use a provider view on mobile phone 
communication but do not provide user specific data com- 
prising hints about inter-personal relationships. User-centric 
studies can provide those insights but include only very lim- 
ited measurements at the moment. 

The remainder of the paper is structured as follows: In 
Section [2] we describe and analyze the problem space and 
discuss other studies using mobile phone contact and com- 
munication data. The design and implementation of our 
survey are presented in Section [3|w ith a subsequent analysis 
of its results in section|4] Section|5]briefly describes applica- 
tion perspectives for the results obtained in this study. We 
conclude with an outlook in Section [6] 

2. PROBLEM STATEMENT & RELATED 
WORK 

With his seminal work, Granovetter [3] established tie 
strength as a measure for the relationships between indi- 
viduals in their social networks. On this basis he showed 
the transitivity of trust, i.e., if two individuals A and B are 
strong ties (e.g., good friends, family, etc.) with person C, 
there also exists a tie between A and B directly. Transitive 
trust provides the basis for a new class of communication 
protocols for mobile phones. Especially in the context of 
phones forming mobile ad-hoc networks (M ANETs), ad-hoc 
trust establishment is needed. When comparing contact list 
data trust can transitively be assigned in a privacy-friendly 
way |12| . Friends of friends detection in mobile networks 
also relies on transitive trust [l3]. Those protocols work 
with the contact list data or communication logs available 



on every mobile phone. In order to exploit this feature the 
tie strength towards other individuals has to be determined. 
Closeness or emotional intensity of a relationship are good 
indicators for strong ties FF] but they are not directly de- 
fined in observable data. On the other hand, duration and 
frequency of communication describe explicitly social inter- 
action and can be discovered from call logs. Closeness, dura- 
tion, and frequency all are positively related to an intimacy 
component of communication [6], combining all aspects in- 
creases the confidence in strong ties. While many of the 
previous studies were performed before the era of widely 
distributed mobile phones recent data is needed to create a 
mapping between technical measurable data (e.g., call and 
message logs) and the associated tie strength or trust at- 
tribute. 

Current studies that could be of use to associate techni- 
cal data of mobile phones with social trust of the phones' 
users are either provider centri c [11||9] or surveyed locally on 
the mobile phones themselves |2| |10| . Operator-based mea- 
surements are based on extensive data but rely on access 
to centrally observed information, which cannot be assumed 
in a deployed ad hoc scenario. Furthermore, they either 
provide only call records of intra-provider communication 
9] or those collected at designated base station controllers 
11] containing only the communication activities that took 
place in geographic vicinity to those switches. In addition, 
data collected at the provider level lacks locally available 
meta data such as tags assigned by the user to contact en- 
tries (e.g., favorites, home or work numbers). These tags 
can provide valuable information in tie strength analysis. 
Local, mobile centric data collection can provide such a rich 
depth of information and generally comprises a complete set 
of communication data of the specific user, not restricted 
by measuring points. The current data sets from mobile 
phones, however, contain only data of at most 100 partici- 
pants and analyze only a limited set of metrics [2]. 

To gain the information needed for protocols using the 
transitivity of trust, all call and message logs available on 
a mobile phone should be comprised. In order to come to 
a conclusion regarding tie strength or trust values towards 
peers, the participants of the study should be able to indi- 
vidually rate their contacts. 

3. EXPLORING SOCIAL TRUST AT MO- 
BILES 

We conducted a quantitative survey to explore the rela- 
tionship between measurable features on mobile phones and 
closeness or trust assigned to corresponding communication 
partners. Analyzing the answers of the subjects in combi- 
nation with gathered local data at the mobiles allows us to 
predict closeness and trust. Our empirical study further ex- 
amines how closeness and trust correlate in a mobile phone's 
context. 

3.1 Basic Survey Setup 

To overcome the problems that arise from pre-defined data 
or biasing environments, we implemented the survey as an 
application for smartphones. The app has direct access to 
the local communication data and thus allows to study to 
which extent data is available on real mobiles and how they 
describe trust and closeness. For selected entries in their 
contact list, participants were asked to respond to each of 




partners : 

- index_in_rating_li s t : 3 
rat ing_trust_in formation : 
rating_trust_best : 1 
rat ing_closeness : 2 
starred: false 
human : true 
cal Is : 

- call_type: outgoing 
duration: 4 

nuinbe rtype : mobile 
relative_date: 2459441 

- call_type: incoming 
duration: 144 
number_type: work 
relative_date: 1552495 

message s : 

- length: 206 
message_type : outgoing 
number_type: mobile 
relative date: 136912.00' 



(a) Rating screen 



(b) Sample 
YAML format 



result in 



Figure 1: Survey app to rate contact list entries 



the following statements: 

Closeness: I feel close to this person. 

Trust information: / would trust this person with sensi- 
tive information. 

Trust best: / trust that this person wants the best for me. 

The first item evaluates closeness, a common measure in 
relationship analysis (cf., Section Bb. The two subsequent 
statements aim to evaluate two characteristics of trust: The 
trust regarding sensitive information is to be given, a very 
important property in the context of ad hoc networks, and 
from a more general perspective the overall trust between 
two people. The responses range from strongly disagree 
(rating 1) to strongly agree (rating 5) on a five point Lik- 
ert scale [2] . Combining all three statements reveals if close- 
ness to an individual is related to trust. 

Conducting a large-scale survey is non-trivial, in partic- 
ular when sensitive information such as real contact data 
will be used. To successfully recruit participants, the survey 
(app) should preserve privacy and operate transparently, as 
well as give motivation to participate. 

3.1.1 Survey Application 

We developed the survey app for Google Android because 
it allows the installation of applications outside the Android 
Market and reaches a variety of people due to the wide range 
of models and device prices. 

The survey application first shows a welcome screen while 
analyzing the call and message logs of the phone at the same 
time. All contacts that appear in those logs are denoted 
(communication) partners and are collected and ranked based 
on the number of interactions with them (i.e., combined 
number of calls and messages). On the next screens (cf., 
Figure 1(a) I the participant will be asked to rate 20 dif- 
ferent partners. The partners prompted are alternately as- 
signed from three classes: contacts that (a) have the most 
interactions with the participant, (b) have the least interac- 
tions (but not zero), and (c) are randomly selected from the 
list of partners. For every contact, the subject should give 
a response to each of the three closeness/trust statements. 
Limiting the number of contacts to rank is important to 
not overload participants with questions. Our measurements 



show that some people have more than 160 communication 
partners. In addition, a pure random selection on a limited 
list may lead to convergence problems, which we avoid with 
our selection strategy. 

To exclude common business contacts in the survey, par- 
ticipants have been asked to use the button labeled This 
is not a person, which skips the current contact. After all 
20 contacts are rated the phone's email client presents the 
results in a message that will be sent. 

Respecting the participants' privacy and acting transpar- 
ently was a huge concern when developing the app. As a 
consequence the results have been made anonymous. Trans- 
parency has been implemented by sending results via email 
and not by an upload in the background that can not be 
observed and controlled by the user. The participants can 
decide whether or not to send the email. Furthermore, the 



results have been encoded in YAML (cf., Figure 1(b) I, a for- 
mat easily readable by humans. A beta phase revealed that 
XML was hard to understand by the subjects. In addition, 
Yahoo's email client on Android removes all XML tags from 
the email leaving it unusable for further processing. 

3.1.2 Acquired Data 

The results of the questionnaire consist of two parts: (a) 
general statistics and (b) details about all communication 
partners. The general section includes the number of en- 
tries in the address book, the amount of calls and messages 
in the corresponding logs, and the number of active com- 
munication partners. The list of communication partners 
contains meta data for every entry (i.e., if applicable, posi- 
tion in the questionnaire and rating; human state; favorite 
state in the address book), as well as a list of all calls and 
messages recorded in the logs. For every call to or from this 
communication partner the relative date, its type (i.e., in- 
coming, outgoing, or missed), the duration in seconds, and 
potential personal tags (e.g., home or work) are stored (cf., 
Figure 1(b) I. The date is concealed for privacy reasons. It 



represents the number of seconds between the time of the 
call and the time the survey app is executed extended by a 
random offset that is locally created for every participant. 
Analogously the listing of messages contains the message 
type, the tag assigned to the number, the relative date, and 
the length of the message in characters. 

3.1.3 Recruiting via Amazon Mechanical Turk 

To reach a greater number of participants and to avoid a 
bias by asking only personal acquaintances, we published the 
survey on the crowd-sourcing platform Amazon Mechanical 
Turk (AMT). The concept behind it is simple: A requester 
offers a Human Intelligence Task (HIT) that will be claimed 
by globally distributed workers, which get a compensation 
for a successful HIT. A single HIT can be handled by mul- 
tiple people. AMT is well-established in the scientific com- 
munity, in particular in the context of social science [I], but 
it gets popular as an appropriate tool for computer scientist, 
as well [5]. A major concern of such an approach might be 
the trust in the reliability of the (paid) answers. However, 
recent studies showed that survey results by AMT workers 
are consistent in quality compared to classic studies [l]. 

Our survey requires workers to download and install the 
Android app, perform the survey, and send the results in 
an email combined with their worker ID, so the task could 
be approved via the AMT platform. We calculate $1 per 



worker, which is slightly above-average but participants need 
to install the app. It is worth noting, that an overpriced task 
may reduce the result quality [8]. 

3.2 Verifying Quality of Answers 

We started our survey in December 2011. In total, we 
received 257 results after 6 weeks. Some workers completed 
the survey more than once, especially when they forgot to 
transmit their worker ID in the first email. This led to 37 
"double" submissions, which have not been considered in 
the analysis to avoid statistical weights for dedicated par- 
ticipants. Overall our subsequent analysis is based on 220 
answers. 

On the AMT webpage where workers claim the HIT, we 
allowed participants to share their opinions and thoughts. 
Surprisingly we received a lot of feedback from several users, 
which helped us to improve the survey and to verify the seri- 
ousness of the answers. At the beginning, a typical feedback 
was such as "There should be an option to discart some of 
the contacts. For example, the app asked me about a pizza 
place that i had in contact list. . . it doesn't make sense to 
answer the questions in this case. ". We already provided 
such a function but thereupon increased its visibility. We 
then asked the concerned participants to repeat the survey, 
which all of them did. In the end, we could not identify a 
significant difference in the number of non-personal contacts 
before and after we applied the small change in the appli- 
cation. This indicates the statistical insignificance of those 
events. 

Overall, the feedback of the participants gives us confi- 
dence that they took the survey seriously and did not ran- 
domly complete the form. Amongst others, this behaviour 
is motivated by a higher reputation for future HITs m. On 
the other hand, three out of 220 participants (1.3%) skipped 
through the contacts without giving a single rating. Those 
results were excluded from further investigation. 

As a site observation we noted that participants have only 
very limited privacy concerns. Only positive comments on 
this topic ("ya survey was very nice and did not hurt me in 
any way. ") have been raised and only a very few downloads 
of the source code have been initiated. 

3.3 General Observations 

After excluding all double and invalid submissions, we 
consider the data of 217 participants, which overall includes 
information about 7,114 communication partners that per- 
formed 22,365 calls and wrote and received 84,325 messages. 
For 3,449 contacts (48,4%) we received personal ratings of 
trust and closeness. The size of the contact list ranges per 
participant from 2 up to 1,827 entries with a mean value of 
249.3. The standard deviation of 308.7 indicates high fluc- 
tuation. However, the address book may include outdated 
entries. On average 32 different communication partners ap- 
pear in the call and message logs that show active parties, 
and an average of 304.6 calls and 739.2 messages have been 
handled. 

A requirement for this study is the collection of a repre- 
sentative observation period to reflect common mobile usage. 
Datasets that include only logs spanning short time periods 
may skew the results in particular if they contain very lim- 
ited communication partners. The typical interaction log 
covers on average 3 months of activities. Similar to the con- 
tact list sizes, the sample intervals are widely spread with 
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Figure 2: Average amount of communication depending on the rating of trust and closeness (1 = complete 
disagree, . . . , 5 = strong agree) 



a standard deviation up to 135.9 days. Around 88% of the 
logs provide data for more than one week and 70% reveal in- 
teraction for more than one month. Six participants exhibit 
very short-dated logs (<1 day). After asking the owners we 
can ensure that this is either coincident ally due to a reset 
of the mobile shortly before the survey started or a recent 
purchase. Usually, these cases exhibit only few communi- 
cation partners. However, some logs show a high amount 
of activity during a short period. One result, for example, 
documents 23 different communication partners in less than 
24 hours. To minimize skewed results, we decided to exclude 
logs that last less than one week and contain less than 10 
communication partners as well as those logs that contain no 
more than 4 different communication partners. The results 
of the 29 affected participants were excluded from further 
analysis. 

For the analysis performed in the following sections we 
take as a basis the survey results cleared from unreliable or 
double submissions. As described above we also filtered out 
results that showed typical patterns for phones that were 
recently reset or put into operation. Thus, in the following 
sections we work with the reliable and significant results of 
188 participants providing information about 22,157 calls 
and 83,941 messages from 6,992 communication partners of 
which 3,331 have been rated. 

4. SURVEY EVALUATION 

The results obtained from the Android survey with 188 
significant data sets can be evaluated in different ways. To 
answer the question if and how it is possible to predict close- 
ness or trust based on available activity logs we identify ap- 
propriate indicators using absolute as well as relative mea- 
surements. We further incorporate manual tags by the mo- 
bile phone's user. 

4.1 Absolute indicators 

In general, the level of trust and closeness is independent 
of the type of communication (voice vs. text) but depends 
significantly on the amount of communication (cf., Figure[2|. 
The top most communication partners are ranked with the 
highest trust value. On average « 8 calls and w 40 mes- 
sages have been exchanged by our surveyed users with very 
trustworthy or close parties. 

To analyze the relationship between trust and closeness 
in more detail, first, we calculate pairwise correlations. As 
shown in Table [l] all correlations are high and statistically 
significant. 

As correlations only have limited validity in the analysis 



of categorical data beyond that, we also conduct a multi- 
ple correspondence analysis (MCA). In a nutshell, MCA is 
an exploratory technique which serves to find common di- 
mensions within a group of variables. Figure [3] shows the 
results for our three survey questions. The first dimension 
reflects the closeness/trust-dimension, with categories 1 to 
5 ordered along the x-axis. Note that categories 1 and 5 
are further away from categories 2 to 4, which lie relatively 
closer to each other. The second dimension reflects the re- 
sponse behavior of the participants, ranging from moderate 
to extreme. Such a dimension is common among data col- 
lected by means of a Likert scale. Most importantly, the 
results show high similarity between each of the categories 
of the three questions, which, together with the high cor- 
relations between the variables indicates that closeness and 
trust towards a person are highly interchangeably. Thus, 
tie strength and closeness (cf., Section [5| can be transferred 
into trust regarding sensitive information and overall good- 
will. Throughout our subsequent analysis, we will only show 
answers to the question / would trust this person with sensi- 
tive information as replies to the other two questions exhibit 
similar values. 

Accumulated over all surveyed users, Figure [4] shows the 
frequency distribution of the given ratings for the three in- 
teraction classes most interactions, least interactions, and 
random (cf., Section [3.1.1[ ). The number of communication 
partners with the least interactions and randomly selected 
peers are more evenly distributed across the rating levels. 
Partners that belong to the high interaction class exhibit 
an increased probability for a strong agreement on trust, in 
which lower rating levels are less pronounced. A high num- 
ber of interactions, thus, may indicate higher trust whereas 
less interacting peers cannot be categorized as trustworthy. 
Note that significantly less partners have been rated with a 
trust level of 2 than all other trust levels. This illustrates 
that the subjective level of non-negative trust towards a per- 
son (level 4 or 5) allows for improved differentiation com- 
pared to the level distrust (level 1 or 2). 

4.2 Manual Tags 

Smartphones provide users with the option to manually 



RatingCloseness Rating Trustlnfo 0.7976 
RatingCloseness RatingTrustBest 0.7465 
RatingTrustlnfo RatingTrustBest 0.7834 
All correlations are statistically significant (99%) 

Table 1: Correlations between rating variables 



Rating Level 


Percent 


Cum. Percent 


5 


60.53 


60.53 


4 


19.55 


80.08 


3 


9.02 


89.10 


2 


5.64 


94.74 


1 


5.26 


100.00 



Table 2: Rating levels for contacts tagged as fa- 
vorites 

add tags to their contact lists. Contacts can be marked as 
favorites to appear in a special list or annotated with dedi- 
cated labels (e.g., home, work, or mobile) to group and ease 
lookup. Our analysis show that those labels are not help- 
ful for trust prediction, since they are not widely used (only 
36.6% of the participants tagged at least one number in their 
contact list) and if a number is tagged its mostly labeled 
with mobile (76.2%). The favorite list, however, is a good 
indicator for trust. Out of 188 participants 97 selected at 
least one contact as favorite. On average, every participant 
that applies this feature tags « 4 communication partners. 
Table|2]shows the probability that a favorite contact attains 
a specific rating level. A favorite contact enjoys a trust value 
of 3 or higher with almost 90% probability. This feature al- 
lows for the identification of trustworthy partners with high 
confidence. 

4.3 Relative indicators 

Besides the absolute values examined in Section [4. 1| rela- 
tive variables can be good indicators for trust as well. Useful 
insights can be gained by examining the proportion of the 
number of calls (or messages) per partner to the total num- 
ber of calls (or messages) of a survey participant. Once again 
the lower rating levels do not show a trend (cf, Figure p5j|. 
The fraction of messages of the total number of messages 
increases with the trust levels 4 and 5. For the fraction of 
calls this is only true for level 5. 

Values that do not prove to be good indicators include the 
average call duration and the average message length (cf, 
Section |4.1[ ). Both do not show the expected trend, that 
means more trust does not result in longer average calls. 
This might be due to the fact that especially the number of 
calls to trusted individuals is high, thus leveraging the mean 
average call duration. Investigating temporally relative val- 
ues, e.g., number of calls per hour, neither provides new 
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Figure 4: Distribution of ratings for three different 
interaction classes over all survey participants 

findings. The number of calls and messages do not correlate 
with the duration of the log, so longer logs do not generally 
mean more interaction. Still, the number of interactions per 
time neither serves as a predictor for trust. 

5. APPLICATION PERSPECTIVE 

The results of our survey can be helpful in providing new 
ways of looking at the socio-technical interface especially in 
mobile phone MANETs. 

5.1 Trust Establishment for Smartphones 

In previous work [12] it was shown that trust establish- 
ment between two mobile phones can be based solely on 
social data inherently available on those devices. When two 
phones need to establish a trust relationship they do so on 
basis of the mutual contacts in their contact lists. Finding 
mutual contacts is a variety of the Private Set Intersection 
Problem which has been subject to many publications. As 
soon as the mutual contact list entries are found each of the 
two mobile phones individually examines the results. The 
plain amount of those mutual contacts can give a rough in- 
dication of the relationship between the two phones' users. 
More insights can be gained with a qualitative analysis. 

When two phones try to establish a trust relationship 
amongst each other they can base this decision on the com- 
bined trust towards the contacts they both share. Therefore, 
each phone calculates a trust value to each of the mutual 
contacts. To be functional even without central infrastruc- 
ture the phone can only rely on data inherently available on 
mobile phones. Consequently, call and message logs serve 
as a foundation for trust evaluation. The combination of 
the trust values towards each mutual contact results in a 
trust estimation of the other device. Note, that both de- 
vices perform an individual trust estimation that may result 
in asymmetrical trust estimates. 

5.2 Finding a metric 

The analysis show that several metrics can be derived from 
a mobile phone's call and message logs that can indicate a 
trust relationship to specific communication partners. Trust 
prediction is fairly easy were high absolute (i.e., number of 
calls and messages to and from the partner, cumulated call 
duration and message length) or relative (i.e., fraction of 
calls and messages of the user's total) values can be ob- 
served. The data is not that evident for lower numbers and 
lower trust levels. Whether a survey participant rated a 
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■ Messages per Rating Level 



Rating Level 


Percent 


Cum. Percent 


5 


42.33 


42.33 


4 


19.33 


61.66 


3 


18.90 


80.56 


2 


8.86 


89.42 


1 


10.58 


100.00 



Table 4: Distribution of rating levels for a joined 
OR condition with thresholds of the 95% quantiles 
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Figure 5: Proportion of the total interactions per 
participant 

communication partner with a level of 2 or 3 seems to be a 
very subjective matter. 

A metric to predict trust should take into account all of 
the parameters mentioned. We create a metric by looking at 
the quantiles for each of the parameters where the trust level 
is 1 (cf., Table |3b. Taking, as an example, the 95% quantile 
and number of calls into account, this means that 95% of 
all partners with a rating level of 1 have had less than 13 
calls with the user. Forming a condition joined by the OR- 
operator combining all variables, we can gradually assign 
trust by comparing the different quantiles. If the values 
of the 95% quantiles are chosen as thresholds, more than 
80% of the communication partners, that satisfy at least 
one of the conditions, have a rating level of 3 or more (cf., 
Table [4| . Using higher (or lower) quantiles as thresholds 
will increase (or decrease) this value. Even more accuracy 
can be gained if the conditions are combined pairwise with 
an AND-operator. For example, if Number of calls > 13 
AND Message length > 2105 more than 85% of the re- 
sulting communication partners have a rating level of 3 or 
higher. Excluding all partners that were not tagged as a 
favorite by the user 100% of the results have a trust level of 
3 or higher. 

Concluding it can be seen that trust can be predicted with 
data typically available on every mobile phone. Merely new 
or recently reset mobile phones might not be able to provide 
a minimum data foundation. As shown in the survey the 
mean log duration is at about 110 days, so we are confident 
to find enough useable data in general. 

5.3 Comparison with existing work 

Social data available on mobile phones can be used in or- 
der to predict future callers and callees [To]. Analogously 
to the trust establishment approach in Section [5.2[ the au- 
thors define a metric to rate the closeness between user and 





Quantiles 




Variable name 


75% 


90% 


95% 


99% 


Number of calls 


2.0 


7.0 


13.0 


42.0 


Number of messages 


2.5 


14.0 


35.0 


84.0 


Duration of calls 


89 


460 


711 


4759 


Message length 


158 


1003 


2105 


5580 


Rel number of calls 


0.6 


2.6 


5.5 


14.6 


Rel number of msgs 


0.6 


2.3 


5.9 


17.4 



contact. The authors classify all contacts in one of three 
groups of different closeness by an algorithm based on the 
call activity with each contact. Applying this metric to our 
data set can verify the algorithm or find weaknesses. 

The call prediction metric associates the contacts into 
three groups "Socially Closest", "Socially Near", and "So- 
cially Distant". In order to perform a meaningful compar- 
ison with our survey a mapping to the five point scale we 
used is needed. The character of the mapping obviously in- 
fluences the outcome of the comparison. As a first, intuitive 
mapping a rating of 1 or 2 was assigned to the "Distant" 
group, a rating of 3 was mapped to the "Near", and a rating 
of 4 or 5 represented the "Closest" communication partners. 
Then for every rated communication partner this binning 
was compared to the outcome of the metric. With this bin- 
ning assumption the mean error for all survey participants 
comes to 0.86 meaning that 86% of the communication part- 
ners were evaluated in a lower group than the survey results 
support. Since the authors only used call activity as a ba- 
sis for their metric |10| the figures above only represent this 
original algorithm. Extending the metric to include messag- 
ing activity as well lowers the mean error to 0.77. Choosing 
a different binning also reduces the mean error. When map- 
ping a rating of 5 to the "closest" class, a rating of 4 to the 
"near" class, and a rating of 3 or less to the "distant" class 
the mean error can be lowered to 0.46. Taking also into 
account the messaging activity this can even further be low- 
ered to 0.38, meaning that about 38% of the contacts in a 
contact list were assigned a closeness evaluation of a lower 
group. Thus, the analyzed metric tends to underestimate 
the closeness of a user to its contacts. Furthermore, using 
only call data and not considering messaging activity also 
reduces its accuracy. This might also be due to the fact that 
the authors only surveyed 10 individuals in regard to their 
closeness towards contacts. With the data collected in the 
scope of this work the metric can be improved. 

6. CONCLUSION & OUTLOOK 

In this paper we presented data from a large-scale survey 
of Android users that helps to bridge the gap between data 
that can be found on mobile phones and social relationships 
users of mobiles share. By analyzing more than 200 contact 
lists and interaction logs we identified metrics that can be 
used to evaluate trust towards peers from a user's contact list 
using absolute as well as relative indicators. The results of 
the survey also indicated that closeness and trust are highly 
correlated, thus linking previous research with our findings. 

The findings can help to improve application scenarios 
like trust establishment in mobile ad-hoc networks of mobile 
phones, since they inherit the social data needed. 



Table 3: Quantiles of selected variables for rating 
level 1 
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